Goto

Collaborating Authors

 Banaadir



Putin rejects key parts of US peace plan as Kremlin official warns Europe faces new war risk: report

FOX News

This material may not be published, broadcast, rewritten, or redistributed. Quotes displayed in real-time or delayed by at least 15 minutes. Market data provided by Factset . Powered and implemented by FactSet Digital Solutions . Mutual Fund and ETF data provided by Refinitiv Lipper .


Diaspora Cookbooks Hit Their Heyday

WIRED

Six new cookbooks bring stellar dishes--and cultures--from around the world into your kitchen. All products featured on WIRED are independently selected by our editors. However, we may receive compensation from retailers and/or from purchases of products through these links. Think about how difficult cooking from a cookbook from another culture was as little as 10 years ago. Once in a while, you could get your hands on a standout, but the food you could make with it could feel like a compromise with too many substitutions and ingredients you just couldn't find without great effort, or at all.


Embedding-Aligned Language Models Guy Tennenholtz

Neural Information Processing Systems

In this paper, we present a novel framework which accomplishes this by exploiting latent embedding spaces to define an objective function for an LLM in an iterative RL-driven process. As an example, consider the challenge of assisting content creators in generating valuable content within a recommender ecosystem (e.g., Y ouTube, Reddit, Spotify) [Boutilier et al., 2024].


Under Trump, US strikes on Somalia have doubled since last year. Why?

Al Jazeera

Mogadishu, Somalia โ€“ Ending the United States' "forever wars" was a major slogan of Donald Trump's 2024 election campaign, during which he and many of his supporters spoke out against American resources and lives being put to waste in conflicts across the globe. But on February 1, a mere 10 days after being inaugurated for a second time, President Trump announced that the US had carried out air strikes targeting senior leadership of ISIL (ISIS) in Somalia. "These killers, who we found hiding in caves, threatened the United States," his post on X read. This marked Trump's first military action overseas, but it wouldn't be his last. In the time since, the US has provided weapons and support to Israel in its wars in Gaza and across the Middle East; it has launched strikes on Yemen; and even attacked Iran's nuclear facilities.


Detection of Somali-written Fake News and Toxic Messages on the Social Media Using Transformer-based Language Models

arXiv.org Artificial Intelligence

The fact that everyone with a social media account can create and share content, and the increasing public reliance on social media platforms as a news and information source bring about significant challenges such as misinformation, fake news, harmful content, etc. Although human content moderation may be useful to an extent and used by these platforms to flag posted materials, the use of AI models provides a more sustainable, scalable, and effective way to mitigate these harmful contents. However, low-resourced languages such as the Somali language face limitations in AI automation, including scarce annotated training datasets and lack of language models tailored to their unique linguistic characteristics. This paper presents part of our ongoing research work to bridge some of these gaps for the Somali language. In particular, we created two human-annotated social-media-sourced Somali datasets for two downstream applications, fake news \& toxicity classification, and developed a transformer-based monolingual Somali language model (named SomBERTa) -- the first of its kind to the best of our knowledge. SomBERTa is then fine-tuned and evaluated on toxic content, fake news and news topic classification datasets. Comparative evaluation analysis of the proposed model against related multilingual models (e.g., AfriBERTa, AfroXLMR, etc) demonstrated that SomBERTa consistently outperformed these comparators in both fake news and toxic content classification tasks while achieving the best average accuracy (87.99%) across all tasks. This research contributes to Somali NLP by offering a foundational language model and a replicable framework for other low-resource languages, promoting digital and AI inclusivity and linguistic diversity.


Understand What LLM Needs: Dual Preference Alignment for Retrieval-Augmented Generation

arXiv.org Artificial Intelligence

Retrieval-augmented generation (RAG) has demonstrated effectiveness in mitigating the hallucination problem of large language models (LLMs). However, the difficulty of aligning the retriever with the diverse LLMs' knowledge preferences inevitably poses an inevitable challenge in developing a reliable RAG system. To address this issue, we propose DPA-RAG, a universal framework designed to align diverse knowledge preferences within RAG systems. Specifically, we initially introduce a preference knowledge construction pipline and incorporate five novel query augmentation strategies to alleviate preference data scarcity. Based on preference data, DPA-RAG accomplishes both external and internal preference alignment: 1) It jointly integrate pair-wise, point-wise, and contrastive preference alignment abilities into the reranker, achieving external preference alignment among RAG components. 2) It further introduces a pre-aligned stage before vanilla Supervised Fine-tuning (SFT), enabling LLMs to implicitly capture knowledge aligned with their reasoning preferences, achieving LLMs' internal alignment. Experimental results across four knowledge-intensive QA datasets demonstrate that DPA-RAG outperforms all baselines and seamlessly integrates both black-box and open-sourced LLM readers. Further qualitative analysis and discussions also provide empirical guidance for achieving reliable RAG systems. Our code is publicly available at https://github.com/dongguanting/DPA-RAG.


LLMs for Targeted Sentiment in News Headlines: Exploring the Descriptive-Prescriptive Dilemma

arXiv.org Artificial Intelligence

News headlines often evoke sentiment by intentionally portraying entities in particular ways, making targeted sentiment analysis (TSA) of headlines a worthwhile but difficult task. Due to its subjectivity, creating TSA datasets can involve various annotation paradigms, from descriptive to prescriptive, either encouraging or limiting subjectivity. LLMs are a good fit for TSA due to their broad linguistic and world knowledge and in-context learning abilities, yet their performance depends on prompt design. In this paper, we compare the accuracy of state-of-the-art LLMs and fine-tuned encoder models for TSA of news headlines using descriptive and prescriptive datasets across several languages. Exploring the descriptive--prescriptive continuum, we analyze how performance is affected by prompt prescriptiveness, ranging from plain zero-shot to elaborate few-shot prompts. Finally, we evaluate the ability of LLMs to quantify uncertainty via calibration error and comparison to human label variation. We find that LLMs outperform fine-tuned encoders on descriptive datasets, while calibration and F1-score generally improve with increased prescriptiveness, yet the optimal level varies.


Embedding-Aligned Language Models

arXiv.org Artificial Intelligence

We propose a novel approach for training large language models (LLMs) to adhere to objectives defined within a latent embedding space. Our method leverages reinforcement learning (RL), treating a pre-trained LLM as an environment. Our embedding-aligned guided language (EAGLE) agent is trained to iteratively steer the LLM's generation towards optimal regions of the latent embedding space, w.r.t. some predefined criterion. We demonstrate the effectiveness of the EAGLE agent using the MovieLens 25M dataset to surface content gaps that satisfy latent user demand. We also demonstrate the benefit of using an optimal design of a state-dependent action set to improve EAGLE's efficiency. Our work paves the way for controlled and grounded text generation using LLMs, ensuring consistency with domain-specific knowledge and data representations.


Addis summit raises questions about AU's muted stance on Ethiopia rifts

Al Jazeera

From Thursday, African leaders will gather in the Ethiopian capital, Addis Ababa, home of the African Union (AU), for the continental body's annual summit. According to AU Commission Chairperson Moussa Faki Mahamat, regional integration and "maintaining momentum in addressing issues of peace and security" is high on the agenda. But in an ironic twist, the host of the summit has either initiated or been involved in multiple conflicts in the last three years. Ethiopia's two-year civil war with the state of Tigray may have ended in November 2022 after a Pretoria pact, but federal troops are currently upping drone strikes against rebels known as Fano militia in the state of Amhara, next door to Tigray. This week, the Ethiopian Human Rights Council said "at least 45 civilians" had been killed by federal troops in Amhara.